Learning correspondences between visual features and functional features
نویسندگان
چکیده
We have implemented a visual learning system MIRACLE-IV , which is capable of obtaining an internai structure of an object from a series of silhouette images with no initial explicit models about the object[l, 2, 3]. The images are derived from only one object, but the forms of the object are varied. The system is composed of two sub-systems: a model-acquisition part (the modeler) and an imageprocessing strategy part (the strategist). On the assumption that the object consists of hinges, slides and solids, the modeler learns the number of them in the object and the relationship between them. The strategist binds the functional features as hinges or slides with visual features in the actual image data. The image-processing sequence for the extraction of the visual feature is not given previously but is learned automatically through trial and error. In our research, mutual references between pattern information and symbol description play essensial roles for learning. This paper describes how MIRACLEIV learns correspondence between fuctional features and visual features.
منابع مشابه
Cross-sensory correspondences and symbolism in spoken and written language.
Lexical sound symbolism in language appears to exploit the feature associations embedded in cross-sensory correspondences. For example, words incorporating relatively high acoustic frequencies (i.e., front/close rather than back/open vowels) are deemed more appropriate as names for concepts associated with brightness, lightness in weight, sharpness, smallness, speed, and thinness, because highe...
متن کاملLearning Visual Features from Large Weakly Supervised Data
Convolutional networks trained on large supervised dataset produce visual features which form the basis for the state-of-the-art in many computer-vision problems. Further improvements of these visual features will likely require even larger manually labeled data sets, which severely limits the pace at which progress can be made. In this paper, we explore the potential of leveraging massive, wea...
متن کاملLearning Deeply Supervised Visual Descriptors for Dense Monocular Reconstruction
Visual SLAM (Simultaneous Localization and Mapping) methods typically rely on handcrafted visual features or raw RGB values for establishing correspondences between images. These features, while suitable for sparse mapping, often lead to ambiguous matches at texture-less regions when performing dense reconstruction due to the aperture problem. In this work, we explore the use of learned feature...
متن کاملEstimation of metallurgical parameters of flotation process from froth visual features
The estimation of metallurgical parameters of flotation process from froth visual features is the ultimate goal of a machine vision based control system. In this study, a batch flotation system was operated under different process conditions and metallurgical parameters and froth image data were determined simultaneously. Algorithms have been developed for measuring textural and physical froth ...
متن کامل6D scan registration using depth-interpolated local image features
This paper describes a novel registration approach that is based on a combination of visual and 3D range information. To identify correspondences, local visual features obtained from images of a standard color camera are compared and the depth of matching features (and their position covariance) is determined from the range measurements of a 3D laser scanner. The matched depth-interpolated imag...
متن کامل